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CLAIMS 



DNA-binding molecule , capable of sequence specific binding 
to the minor groove of double -stranded DNA, characterised in 
that it comprises at least two sequence specific SNA-binding 
elements, covalently linked to each other in tandem 
•orientation by an amphipathic, flexible linker molecule, at 
least one of said DNA binding elements being non- 
proteinaceous . 

DNA-binding molecule according to claim 1 wherein at least 
one of the DNA-binding elements comprises an oligomer 
comprising one or more organic heterocyclic araino-acid 
residues. % 



DJJA-binding molecule according to claim 2 wherein each 
organic heterocyclic residue has at least one annular 
nitrogen, sulphur or oxygen. 

DNA-binding molecule according to claim 2 or 3 wherein said 
heterocyclic residue is chosen from pyrrole, imidazole, 
triable, pyrazole, furan, thiawle, thiophene, oxazole 
pyridine, or derivatives of any of 'these compounds wherein 
one or more of the heteroatome . are substituted by a 
aubstituent which is DNA-binding or non-DNA- binding _ 

D»A-binding mo lecule according to claim 4 wherein at least 
one oligomer includes heterocyclic residues chosen 11 " 
methylpyrrole %( py, and /or , . A C *° 8en from 

3-hydroxy N-methyl pyrrole , Rp) 
and / or N-methylimidazole dm) . W) 

»-b±adlng molecule according to anv 

urcner comprises at least 
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one aliphatic amino acid residue such as a p-alanine ((3) 
residue, or a 5-aminovaleric acid residue. 

DNA-binding molecule according to any one of claims 1 to 4, 
having the general formula (1) : 



(L) T 



(I) 



N 



CP 31 ] — W»h 



N 



wherein 

each of P 1 to P n represents a DNA-binding element, said 
element comprising multiple organic heterocyclic or 
aliphatic residues or fluorescent derivatives thereof ; 
each of R 1 to R fl represents a DNA-binding element, said 
element comprising multiple organic heterocyclic or 
aliphatic residues or fluorescent derivatives thereof ; 

- x represents an integer from 1 to 20 , with the proviso 
that when x is greater than 1, the multiple copies of 
[R"] , tL n l , [P»] and [T n ] may be the same or different ; 

- n represents an integer having a value equal to (x+l) ; 
[T] represents a multifunctional linking molecule 
providing a covalent link between DNA-binding elements 
[R] and TP], with the proviso that if « e w represents 0, 
[T X+1 J can be bifunctional j 

- each of a and c independently represent 0 or 1 ; 
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each of b and d independently represent o or 1, with the 
proviso that when a represents 0, b also represents 0, 
and when c represents 0, d also represents 0 ; 

[D] represents an end group or an effector moiety, 

[l>) m represents an amphipathic, flexible linker molecule, 
linking the DNA-binding elements in a tandem orientation 
with respect to each other ; 
♦ - m represents an integer from 1 to 10 ? 

[B] represents a spacer unit such as p- alanine ; 

[Z] represents an end group or an effector moiety ? 

each of f , g and e independently represent 0 or 1, 

each solid line represents a covalent bond ; 

N and c indicate the and c- terminal extremities of the 

molecule , respectively . 

DNA-binding molecule according to claim 7 wherein the DNA- 
binding elements P and R comprise heterocyclic residues 
chosen from pyrrole, imidazole, triazole, pyrazole, furan, 
thiazole, thiophene, oxazole, pyridine, or derivatives of any 
of these compounds wherein one or more of the heteroatoms is 
substituted. 



DNA-binding molecule according to claim 8 having the general 
formula (II) : 



<- ' J. 



x 



wherein [p 1 ] . fp* 1 ! / T x rril 

>< IP J, (L), Id] , m f x, m , f, a and e hav P ^ 
previously denned meanings * 

— a dotted line represents a covalent bond wnich M n b 
present or absent. wnicn can be 
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10 . iNA-binding molecule according to claim* 9 wherein each of the 
the DNA-binding elements [P J ] to [P 11 ] independently have the 
general formula (III) 

- {-U 1 - [U]« ] — (III) 



wherein : 

each U ia independently a monomeric unit chosen from a 

heterocyclic amino acid residue, or an aliphatic amino acid 

residue or a fluorescent derivative thereof, and 

o is an integer from 1 to 15, preferably from 2 to 8, 

and a dotted line represents a covalent bond which can be 

present or absent. 

11. DNA-binding molecule according to claim 10 wherein at least 
8 one U is chosen from N-methylpyrrole (Py) and /or 3 -hydroxy 

N-methylpyrrole (HP) and / or N-methyl imidazole (Im) . 

12. DNA-binding molecule according to claim 10 wherein at least 
one U is a p- alanine (£) residue, or a 5-aminovaleric acid 
residue . 

13. DNA-binding moJuecule according to claim 11 or 12 wherein S 
is an integer from 2 to 5 

14. DNA-binding molecule according to claim 13 wherein at least 
one of [P 1 ] to [p*] comprises between 3 to 5 heterocyclic 
amino acid residues. 



15. DNA-binding molecule accordin 9 to claim 13 wherein at leaet 
^ one of [P>] to [P»] comprises u.ore than two contiguous 
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heterocyclic amino acid residues, for example three, four or 
five contiguous heterocyclic amino acid residues. 

DNA-binding molecule according to claim 15 wherein stretches 
of three to five contiguous heterocyclic amino acid residues 
are separated from each other by a P-alanine residue 

DNA-binding molecule according to claim 10 wherein at least 
one of [P 1 ] to [p w ] has the formula (IV) 



N - C 
- pJJ ~ - [U,] - rU v ] - [U 5 ] - [U«] - [U ? ] - [U ft ] -~j (IV) 

wherein U is as previously defined, 
[U 4 ] is p-alanine, 

[UJ to [U 33 , and [U 5 ] to [U 7 ] are chosen from N-methylpyrrole 
(PyJ and / or N-methylimidazole (Ira), 
[U B ] may be present or absent, and if present is 
preferentially p- alanine, 

and a dotted line represents a covalent bond which can be 
present or absent. 



DNA-binding molecule according to claim 17 wherein [UJ to 
tU 3 ], and [U 5 ] to [U 7 ] are each N-raethylpyrrole (Py) . 

DNA-binding molecule according to claim 10 wherein at least 
one of [P 1 ] to [P 11 ] has the fomula (V) ; 

N c 

- [U a ] - [U a ] - [UJ - [U 5 ] - [U 6 ] - [U v ] - [U e ] - [U 9 ] 7j . .. (V) 
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wherein : 

- U is as previously defined, 

* - IUJ to [U 6 ] are chosen from KF-methylpyrrole (Py) , N- 
methylimidazole (Im) and a P alanine residue, 

- With the proviso that the [U] immediately adjacent to 
each Im on the N- terminal aide is a p alanine residue, 

[U 9 ] may be present or absent, and if present is 
preferentially p-alanine, 

and a dotted line represents a covalent bond which can be 
present or absent . 



20. DNA-binding molecule according to claim 19 wherein at least 
one of [P 1 ] to [P 71 ] has the formula (VI) : 



N c 

—Urn- P- Im- Py ^p- Xm-p- Im- 0 J._ . 



(VI) 



21. DNA-binding molecule according to any. one of claims 9 to 20 
wherein x represents a value from 2 to io # for example 2, 3, 
4, 5, 6, 7, 8, 9, or 10. 

22, DNA-binding molecule according to claim 7 having the general 
formula (VII) : 



N 



(VII) 



LR 1 ] 



[D] f — [B] g — [P 1 ] — [T 1 ] 



CD, 



[R n ] 



[P°] — [T n ]- 



[Zh 



NT 
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wherein [R 1 ] , [P] , [r»] , [P »] , [T ^] , [T *] # (L) / [D] ? [B] # 
[Z] , m, n, g, f and e have the previously defined meanings. 

23. DNA-binding molecule according to claim 22 wherein each of 
the DNA-binding elements to [P n ] and [r 1 ] to [R n J 
independently have the general formula (VIII) 

-....[u 1 - [u].-3 — (viii) 

wherein : 

« each U is independently a monomeric unit chosen from a 
heterocyclic amino acid residue, or an aliphatic amino acid 
residue or a fluorescent derivative of the foregoing, and 
s is an integer from. 0 to 15, preferably from 1 to 6 
and a dotted line represents a covalent bond which can be 
present or absent* 

24. DNA-binding molecule according to claim 23 wherein at least 
one heterocyclic amino acid residue comprises an annular 
nitrogen. 

25 . DNA-binding molecule according to claim 24 wherein at least 
one of [P 1 ] to [P" 1 ] or [R 1 ] to [R n ] contain a residue of N- 
methylpyrrole (Py) and /or 3 -hydroxy N-methylpyrrole (HP) 
and / or N-me thy 1 imidazole (Im) . 

26. DNA-binding molecule according to claim 25 wherein at least 
one of [p 1 ] to [P n ] or [R l ] to [R n ] contain an aliphatic 
amino-acid residue such as a p-alanine (p) residue. 
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27, DNA-binding molecule according to claim 23 or 24 wherein S 
is an integer from 2 to 6. 

28 5 DNA-binding molecule according to cl^im 23 or 24 wherein at 
least one of [P 1 ] to [P n ] or [R 1 ] to iR A ] comprises at least 
four heterocyclic amino acid residues. 

29. DNA-binding molecule according to claim 23 wherein at least 
one of IP 1 } to [P* 1 ] or [R 1 ] to [R*] comprises more than two 
contiguous heterocyclic amino acid residues, for example 
three, four or five contiguous heterocyclic amino acid 
residues „ 

30 DNA-binding molecule according to claim 29 wherein stretches 
of three to five contiguous heterocyclic amino acict residues 
are separated from each other by a p-alanine residue. 

31. DNA-binding molecule according to claim 23 wherein at least 
8 one [P tt ] element has the formula (IX) : 

N C 
t-[Ui] [U fl . x ] - [U.J.J-- (IX) 

and at least one [R n ] element has the formula (X) : 



N C 

[ [U q ] [U 2 ] - [UiH (X) 

wherein each U represents independently N-methylpyrrole 
(Py),or 3 -hydroxy N-methylpyrrole (HP), or N-methyl imidazole 
(Im) or N-methyl pyrazole (Pz) , or 3 -pyrazolecar boxy lie acid 



c 
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(3-Pz), or p-alanine (P) , q and s are independently integers 
from 1 to 10, 

and a dotted line represents a covalent bond which can be 
present or absent, 

wherein the U residues of [P*] form anti-parallel pairs with 
the u residues of [R a J : 



said pairs being chosen from Py/lm, Im/Py, Py/Py, Hp/Py, 
Py/Hp, p/Py, Py/pr P/Ini, Im/Pi lm/lm, Pz/Py, 3-Pz/Pz, and 
P/P- 



DNA-binding molecule according to claim 9 wherein at least 
one DNA-binding element contains [T] , [R] and [P] moieties , 
and at least one DNA binding element is free of [T] and [R] 
moieties. 

DNA-binding molecule according to claim 9 wherein the 
multiple [RJ and [P] elements are different in length and / 
or composition, 

DNA-binding molecule according to any one of the preceding 
claims 7 to 34 having the capacity to bind in a multidentate 
mode to a given strand of DNA. 



N 



C 




[T] ~ 




C 



DNA-binding molecule according to any one of the preceding 
claims 7 to 3 5 wherein m represents a value greater than or 
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equal to one, and the amphipathic linker (L) m comprises an 
assembly of linker sub-units (L) r 

DNA-binding molecule according according to claim 36 wherein 
the assembled linker (L) m is heterobi functional . 

DNA-binding molecule according to claim 36 wherein each 
linker sub-unit (L) is heterobif unctional . 

DNA-binding molecule according to claim 36, wherein at least 
one (I*} sub-unit is amphipathic. 

DNA-binding molecule according to claim 37 wherein the total 
length of the linker (l) b is between 5 to 250 Angstroms, for 
example 5 to 50 Angstroms. 

DNA-binding molecule according to claim 37 wherein the 
functional groups are chosen from amino, carboxyl, thiol, 
haloacetyl, aldehyde, amino-oxy, maleimide groups, a 
symetrical anhydride and halogen atoms. 

DNA-binding molecule according to claim 36 wherein at least 
one amphipathic linker (L) sub-unit comprises one or more 
ether groups and/or ester groups for example molecules 
derived from ethylene oxide or propylene oxide. 

DNA-binding molecule according to claim 36 wherein at least 
one amphipathic* linker (L) sub-unit comprises one or more 
units of 8-amino-3, 6-dioxaoctanoic acid (Ac) . 

DNA-binding molecule according to any one of claims 1 to 43 
having the capacity to bind in a sequence specific manner to 
a DNA recognition sequence of at least 6, preferably at 
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« 

least 10 and most preferably at least 14 base pairs in 
length. 

45- DNA-binding molecule according to any one of claims 1 to 44 
having a molecular weight no greater than approximately S 
kDa. * 

46- DNA-binding molecule according to any one of claims 1 to 45 
wherein the said molecule binds to the DNA minor groove. 

47. DNA-binding molecule according to any one of claims 1 to 46 
which is cell -permeable. 

48. DNA-binding molecule according to any one of claims 1 to 47 
« * 

having an apparent binding affinity of at least 5 x 10 1 M' 1 . 

49. DNA-binding molecule according to any one of claims 1 to 48 
having an apparent binding affinity of at least 1 x 10 5 M" 1 . 

50. DNA-binding molecule according to any one of claims 1 to 49 
having an apparent binding affinity of at least 5 x 10 10 M" 1 . 

51. Process for binding double-stranded DNA in a sequence- 
specific manner, comprising contacting a DNA-target sequence 
within said DNA with a DNA-binding molecule according to any 
one of claims 1 to 50, in conditions allowing said binding 
to occur. 

52. Process according to claim 51 which is carried out in vivo, 
• in vitro or ex vivo. 

53 . Process according to claim 52 which is carried out in a 
cell. 
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54. Process according to claim 53, wherein said cell ia 
eukaryotic - 

55. Process according to claim 53 , wherein said cell is 
prokaryotic. 

# 

56. Process according to claim 54, wherein said cell is a 
vertebrate cell, an invertebrate cell, a plant cell 

57. Process according to claira54 / wherein said cell is a 
mammalian cell, an insect cell, or a yeast cell. 

58. Process according to any one of claims 51 to 57 wherein the 
double stranded DNA is endogenous to said cell. 

59* Process according to any one of claims 51 to 57 wherein the 
double stranded DNA is heterologous to said cell. 

60. Process according to claim 53 wherein the double stranded 
DNA target sequence comprises a chromatin element . 

61. Process according to claim 60 wherein the target sequence 
comprises a SAR-lik.e sequence. 

62. Process according to claim 60 wherein the target sequence 
comprises a GAGAA repeat sequence. 

63 . Process according to any one of claims 50 to 62 wherein the 
target sequence has at least 8 and preferably at least 15 
bases . 
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64. * Process according to claim 60 wherein the target sequence is 

a cia- or trans -acting element mediating chromosome 
function. 

65. Process according to claim 64 wherein the binding of the 
target element with the sequence-specific binding molecule 
givee rise to cia- and / or trans -regulation of chromosome 
function, 

66. Process according to claim 53 wherein the double stranded 
DNA target sequence comprises a site mediating the activity 
of one or more regulatory factors _ 

67 . Process according to claim 66 wherein the regulatory factors 
is a transcription regulatory factor, a DNA replication 

, factor, a factor for enzymatic activity, a factor involved 
in chromosome stability. 

66. Process according to .any one of claims 51 to 67, wherein the 
DNA-binding molecule is linked to an effector moiety. 

69. Process for modulating chromosome function in a eukaryotic 
cell, 

comprising the step of contacting a genomic DNA element, 
comprising a binding site mediating chromosome function, 
with a molecule according to any one of claims 1 to 50 and 
having the capacity to bind in a sequence- specific manner to 
said element, 

Baid step of contacting being carried out in conditions 
permitting binding of said compound to -said element, 
wherein the binding modulates chromosome function. 

« 
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70. Process for modulating the function of a DNA element in a 
eukaryotic cell. 

Comprising the step of contacting a genomic DNA element, so- 
called « chromatin responsive element » (CRE) , 
with a molecule according to any one of claims 1 to 50 and 
having the capacity to bind in a sequence -specif ic manner to 
said CRE, 

said step of contacting being carried out in conditions 
* permitting chromatin remodeling of the CRE by said compound, 
wherein said chromatin remodeling of the CRE alters the 
activity of one or more other DNA elements, so called 
« modulated DNA elements * in the genome. 

H- Cell containing a compound according to any one of claims l 
to 50, 

'2. Cell according to claim 71, wherein said compound binds the 
DNA-minor groove, 

3. Cell according to claim 71 which is a eukaryotic cell. 

4. Non-human organism comprising a cell according to claim 71. 

5. t Organism according to claim 74 which is a non-human animal. 

6. Organism according to claim 75 which* is a transgenic, non- 
human animal * 

7. Organism according to claim 74 which is a plant. 

8. Organism according to claim 77 which is a transgenic plant. 
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79. Pharmaceutical composition comprising a compound according 
* 

to any one of claims 1 to 50 in association with a 
physiologically acceptable excipient. 

80. Compound according to any one of claims I to 50, for use in 
therapy. 

81* Compound according to any one of claims 1 to 50 which ie 
fluorescent or fluorescent ly labelled, 

82. Compound according to claim 81 wherein the fluorescent label 
is a fluorescent dye such as fluorescein, dansyl, Texas red, 
isosulfan blue, ethyl red, malachite green, rhodamine and 
cyanine dyes. 

83^. Use of a compound according to claim 81 for probing the 
epigenetic state and location of DNA in chromosomes and 
nuclei * 

84. Use according to claim B3 for diagnosis of pathological 
conditions arising from epigenetic status. 

85. Use of a compound according to claim 81 for chromosome 
visualisation and marking in diagnosis, forensic studies, 
affiliation studies, or animal husbandry. 



